Two-microphone voice activity detection in the presence of coherent interference
نویسندگان
چکیده
In this paper, we propose a two-microphone Voice Activity Detection (VAD) method in the presence of coherent interference. The proposed method is based on the Cross Power Spectrum Phase (CPSP) which is an implementation of the Phase Transform (PHAT) weighted cross correlation between two microphones. The PHAT weighting whitens the spectrum of input signals and makes the cross correlation dependent entirely on the phase of the cross spectrum. If we assume that the direction of desired speech signal is known and the time delay between microphones is compensated, the Averaged CPSP (A-CPSP) can be utilized as a VAD measure. In order to enhance the VAD performance in the presence of strong coherent interference from other direction, we propose a Maximum Partially Averaged Real CPSP (MPA-RCPSP) method which detects the cophased frequency region with high Signal-toInterference Ratio (SIR). Simulation results demonstrate that the proposed MPA-RCPSP is a more reliable measure to the conventional A-CPSP in the presence of strong coherent interference.
منابع مشابه
Multi-Channel Voice Activity Detection Based on Conic Constraints
Unlike single microphone techniques for voice activity detection (VAD), multi-microphone signal processing usually exploits the spatial information of signals received at multiple microphones. In this paper, we propose a VAD algorithm based on conic constraints to achieve robustness against the direction of arrival (DOA) estimation error. The proposed algorithm uses the phase vector as feature ...
متن کاملVoice activity detection using the phase vector in microphone array
If desired speech source is located at different position from interference, it is possible to exploit spatial selectivity for reliable speech detection. In this paper, we propose a voice activity detector (VAD) for the microphone array system, using spatial information obtained by the eigendecomposition of multi-channel correlation matrix. We use the phase vector as a measure for VAD, which is...
متن کاملA Dual-Microphone Speech Enhancement Algorithm for Close-Talk System
While human listening is robust in complex auditory scenes, current speech enhancement algorithms do not perform well in noisy environments, even close-talk system is used. This paper addresses the robustness in dual microphone embedded close talk system by employing a computational auditory scene analysis (CASA) framework. The energy difference between the two microphones is used as the primar...
متن کاملAcquiring molecular interference functions of X-ray coherent scattering for breast tissues by combination of simulation and experimental methods
Background: Recently, it has been indicated that X-ray coherent scatter from biological tissues can be used to access signature of tissue. Some scientists are interested in studying this effect to get early detection of breast cancer. Since experimental methods for optimization are time consuming and expensive, some scientists suggest using simulation. Monte Carlo (MC) codes are the best...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006